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12.1 Hypothesis Testing of Single Mean and Single Proportion: 
Introduction TCC 


Student Learning Outcomes 
By the end of this chapter, the student should be able to: 


¢ Differentiate between Type I and Type II Errors 

e Describe hypothesis testing in general and in practice 

¢ Conduct and interpret hypothesis tests for a single population mean, 
population standard deviation known. 

¢ Conduct and interpret hypothesis tests for a single population mean, 
population standard deviation unknown. 

¢ Conduct and interpret hypothesis tests for a single population 
proportion. 


Introduction 


One job of a Statistician is to make statistical inferences about populations 
based on samples taken from the population. Confidence intervals are one 
way to estimate a population parameter. Another way to make a Statistical 
inference is to make a decision about a parameter. For instance, a car dealer 
advertises that its new small truck gets 35 miles per gallon, on the average. 
A tutoring service claims that its method of tutoring helps 90% of its 
students get an A or a B. A company says that women managers in their 
company earn an average of $60,000 per year. 


A Statistician will make a decision about these claims. This process is called 
"hypothesis testing." A hypothesis test involves collecting data from a 
sample and evaluating the data. Then, the statistician makes a decision as to 
whether or not there is sufficient evidence based upon analyses of the data, 
to reject the null hypothesis. 


In this chapter, you will conduct hypothesis tests on single means and single 
proportions. You will also learn about the errors associated with these tests. 


Hypothesis testing consists of two contradictory hypotheses or statements, a 
decision based on the data, and a conclusion. To perform a hypothesis test, a 
Statistician will: 


1. Set up two contradictory hypotheses. 

2. Collect sample data (in homework problems, the data or summary 
Statistics will be given to you). 

3. Determine the correct distribution to perform the hypothesis test. 

4. Analyze sample data by performing the calculations that ultimately 
will allow you to reject or fail to reject the null hypothesis. 

5. Make a decision and write a meaningful conclusion. 


Note:To do the hypothesis test homework problems for this chapter and 
later chapters, make copies of the appropriate special solution sheets. See 
the Table of Contents topic "Solution Sheets”. 


Glossary 


Confidence Interval (CI) 
An interval estimate for an unknown population parameter. This 


depends on: 


e The desired confidence level. 

e Information that is known about the distribution (for example, 
known standard deviation). 

e The sample and its size. 


Hypothesis Testing 
Based on sample evidence, a procedure to determine whether the 
hypothesis stated is a reasonable statement and cannot be rejected, or 
is unreasonable and should be rejected. 


12.2 Hypothesis Testing of Single Mean and Single Proportion: Null and 
Alternate Hypotheses TCC 


The actual test begins by considering two hypotheses. They are called the 
null hypothesis and the alternate hypothesis. These hypotheses contain 
opposing viewpoints. 


H,: The null hypothesis: It is a statement about the population that will be 
assumed to be true unless it can be shown to be incorrect beyond a 
reasonable doubt. 


H,: The alternate hypothesis: It is a claim about the population that is 
contradictory to H, and what we conclude when we reject Ho. 


Example: 

H,: 30% of the registered voters in Santa Clara County voted in the 
primary election. 

Hf,: More than 30% of the registered voters in Santa Clara County voted in 
the primary election. 


Example: 

We want to test whether the mean grade point average in American 
colleges is different from 2.0 (out of 4.0). 

Vale yt, Ae ps 


Example: 

We want to test if college students take less than five years to graduate 
from college, on the average. 

Vay Ao 


Example: 

In an issue of U. S. News and World Report, an article on school 
standards stated that about half of all students in France, Germany, and 
Israel take advanced placement exams and a third pass. The same article 
stated that 6.6% of U. S. students take advanced placement exams and 4.4 
% pass. Test if the percentage of U. S. students who take advanced 
placement exams is more than 6.6%. 

fae eyg 1 Pe) 


Since the null and alternate hypotheses are contradictory, you must examine 
evidence to decide if you have enough evidence to reject the null hypothesis 
or not. The evidence is in the form of sample data. 


After you have determined which hypothesis the sample supports, you 
make a decision. There are two options for a decision. They are "reject H," 
if the sample information favors the alternate hypothesis or "do not reject 
Hf," or "fail to reject H," if the sample information is insufficient to reject 
the null hypothesis. 


Mathematical Symbols Used in H, and A: 


A, A, 


equal (_ ) not equal( )orgreaterthan( )orlessthan(_ ) 


Note: H, always has and equals symbol in it. H, never has a symbol with 
an equal in it. The choice of symbol depends on the wording of the 
hypothesis test. However, be aware that many researchers (including one of 


the co-authors in research work) use symbols other than in the Null 
Hypothesis, even with or asthe symbol in the Alternate Hypothesis. 
This practice is acceptable because we only make the decision to reject or 
not reject the Null Hypothesis. 


Optional Collaborative Classroom Activity 


Bring to class a newspaper, some news magazines, and some Internet 
articles . In groups, find articles from which your group can write a null and 
alternate hypotheses. Discuss your hypotheses with the rest of the class. 


Glossary 


Hypothesis 
A statement about the value of a population parameter. In case of two 
hypotheses, the statement assumed to be true is called the null 
hypothesis (notation H ) and the contradictory statement is called the 
alternate hypothesis (notation H,). 


12.3 Hypothesis Testing of Single Mean and Single Proportion: Outcomes 
and the Type I and Type II Errors TCC 


When you perform a hypothesis test, there are four possible outcomes 
depending on the actual truth (or falseness) of the null hypothesis H, and 
the decision to reject or not. The outcomes are summarized in the following 
table: 


ACTION H, 1S ACTUALLY 

True False 
Do not reject H, Correct Outcome Type II error 
Reject H, Type I Error Correct Outcome 


The four possible outcomes in the table are: 


e The decision is to not reject H, when, in fact, H, is true (correct 
decision). 

e The decision is to reject H, when, in fact, H, is true (incorrect 
decision known as a Type I error). 

e The decision is to not reject H, when, in fact, H, is false (incorrect 
decision known as a Type I error). 

e The decision is to reject H, when, in fact, H, is false (correct 
decision whose probability is called the Power of the Test). 


Each of the errors occurs with a particular probability. The Greek letters a 
and @ represent the probabilities. 


qa = probability of a Type I error = P(Type I error) = probability of 
rejecting the null hypothesis when the null hypothesis is true. 


@ = probability of a Type II error = P(Type II error) = probability of not 
rejecting the null hypothesis when the null hypothesis is false. 


a and £ should be as small as possible because they are probabilities of 
errors. They are rarely 0. 


The Power of the Test is 1 — @. Ideally, we want a high power that is as 
close to 1 as possible. Increasing the sample size can increase the Power of 
the Test. 


The following are examples of Type I and Type II errors. 


Example: 

Suppose the null hypothesis, Ho, is: Frank's rock climbing equipment is 
safe. 

Type I error: Frank thinks that his rock climbing equipment may not be 
safe when, in fact, it really is safe. Type II error: Frank thinks that his 
rock climbing equipment may be safe when, in fact, it is not safe. 

qa = probability that Frank thinks his rock climbing equipment may not be 
safe when, in fact, it really is safe. 6 = probability that Frank thinks his 
rock climbing equipment may be safe when, in fact, it is not safe. 

Notice that, in this case, the error with the greater consequence is the Type 
II error. (If Frank thinks his rock climbing equipment is safe, he will go 
ahead and use it.) 


Example: 

Suppose the null hypothesis, H,, is: The victim of an automobile accident 
is alive when he arrives at the emergency room of a hospital. 

Type I error: The emergency crew thinks that the victim is dead when, in 
fact, the victim is alive. Type II error: The emergency crew does not 
know if the victim is alive when, in fact, the victim is dead. 

qa = probability that the emergency crew thinks the victim is dead when, 
in fact, he is really alive = P(Type I error). G = probability that the 


emergency crew does not know if the victim is alive when, in fact, the 
victim is dead = P(Type IJ error). 

The error with the greater consequence is the Type I error. (If the 
emergency crew thinks the victim is dead, they will not treat him.) 


Glossary 


Type 1 Error 
The decision is to reject the Null hypothesis when, in fact, the Null 


hypothesis is true. 


Type 2 Error 
The decision is to not reject the Null hypothesis when, in fact, the Null 


hypothesis is false. 


12.4 Hypothesis Testing of Single Mean and Single Proportion: Distribution 
Needed for Hypothesis Testing TCC 


Earlier in the course, we discussed sampling distributions. Particular 
distributions are associated with hypothesis testing. Perform tests of a 
population mean using a normal distribution or a student's-t 
distribution. (Remember, use a student's-t distribution when the population 
standard deviation is unknown and the distribution of the sample mean is 
approximately normal.) In this chapter we perform tests of a population 
proportion using a normal distribution (usually n is large or the sample size 
is large). 


If you are testing a single population mean, the distribution for the test is 
for means: 


X~ N(x 2x) or t 


The population parameter is jz. The estimated value (point estimate) for pu is 
x, the sample mean. 


If you are testing a single population proportion, the distribution for the 
test is for proportions or percentages: 


~N pyf2e 


The population parameter is p. The estimated value (point estimate) for p is 
= where « is the number of successes and n is the sample 


size. 


Glossary 


Normal Distribution 
A continuous random variable (RV) with pdf 


e * # © | where p is the mean of the distribution and 
oO vi) 


o is the standard deviation. Notation: X~ N po . If u and 
oO , the RV is called the standard normal distribution. 


Standard Deviation 
A number that is equal to the square root of the variance and measures 
how far data values are from their mean. Notation: s for sample 
standard deviation and o for population standard deviation. 


Student's-t Distribution 
Investigated and reported by William S. Gossett in 1908 and published 
under the pseudonym Student. The major characteristics of the random 
variable (RV) are: 


e It is continuous and assumes any real values. 

e The pdf is symmetrical about its mean of zero. However, it is 
more spread out and flatter at the apex than the normal 
distribution. 

e It approaches the standard normal distribution as n gets larger. 

e There is a "family" of t distributions: every representative of the 
family is completely defined by the number of degrees of 
freedom which is one less than the number of data. 


12.5 Hypothesis Testing of Single Mean and Single Proportion: 
Assumptions TCC 


When you perform a hypothesis test of a single population mean jy using 
a Student's-t distribution (often called a t-test), there are fundamental 
assumptions that need to be met in order for the test to work properly. Your 
data should be a simple random sample that comes from a population that 
is approximately normally distributed. You use the sample standard 
deviation to approximate the population standard deviation. (Note that if 
the sample size is sufficiently large, a t-test will work even if the population 
is not approximately normally distributed). 


When you perform a hypothesis test of a single population mean jj using 
a normal distribution (often called a z-test), you take a simple random 
sample from the population. The population you are testing is normally 
distributed or your sample size is sufficiently large, usually 30 and above. 
You know the value of the population standard deviation. 


When you perform a hypothesis test of a single population proportion p, 
you take a simple random sample from the population. You must meet the 
conditions for a binomial distribution which are there are a certain number 
n of independent trials, the outcomes of any trial are success or failure, and 
each trial has the same probability of a success p. The shape of the binomial 
distribution needs to be similar to the shape of the normal distribution. To 
ensure this, the quantities mp and n(1-p) must both be greater than ten ( 

np > 10 and n(1-p) > 10). Then the binomial distribution of sample 
(estimated) proportion can be approximated by the normal distribution with 


p= pando = 1 Zoe), 


Glossary 


Binomial Distribution 
A discrete random variable (RV) which arises from Bernoulli trials. 
There are a fixed number, n, of independent trials. “Independent” 
means that the result of any trial (for example, trial 1) does not affect 
the results of the following trials, and all trials are conducted under the 


Same conditions. Under these circumstances the binomial RV_X is 
defined as the number of successes in n trials. The notation is: X~ 
B(n, p). The mean is = np and the standard deviation is o = ,/npq 
. The probability of exactly x successes in 7 trials is 

P= a = 3 pg, 


Normal Distribution 
A continuous random variable (RV) with pdf 


x) = = e~(*-H)"/20° | where js is the mean of the distribution and 


o is the standard deviation. Notation: X ~ N(y, 0). If uw = 0 and 
ao = 1, the RV is called the standard normal distribution. 


Standard Deviation 
A number that is equal to the square root of the variance and measures 
how far data values are from their mean. Notation: s for sample 
standard deviation and o for population standard deviation. 


Student-t Distribution 
Investigated and reported by William S. Gossett in 1908 and published 
under the pseudonym Student. The major characteristics of the random 
variable (RV) are: 


e It is continuous and assumes any real values. 

e The pdf is symmetrical about its mean of zero. However, it is 
more spread out and flatter at the apex than the normal 
distribution. 

e It approaches the standard normal distribution as n gets larger. 

e There is a "family" of t distributions: every representative of the 
family is completely defined by the number of degrees of 
freedom which is one less than the number of data. 


12.6 Hypothesis Testing of Single Mean and Single Proportion: Using the 
Sample to Test the Null Hypothesis TCC 


Use the sample data to calculate the actual probability of getting the test 
result, called the p-value. The p-value is the probability that, if the null 
hypothesis is true, the results from another randomly selected sample 
will be as extreme or more extreme as the results obtained from the 
given sample. 


A large p-value calculated from the data indicates that we should fail to 
reject the null hypothesis. The smaller the p-value, the more unlikely the 
outcome, and the stronger the evidence is against the null hypothesis. We 
would reject the null hypothesis if the evidence is strongly against it. 


Draw a graph that shows the p-value. The hypothesis test is easier to 
perform if you use a graph because you see the problem more clearly. 


Example: 

(to illustrate the p-value) 

Suppose a baker claims that his bread height is more than 15 cm, on the 
average. Several of his customers do not believe him. To persuade his 
customers that he is right, the baker decides to do a hypothesis test. He 
bakes 10 loaves of bread. The mean height of the sample loaves is 17 cm. 
The baker knows from baking hundreds of loaves of bread that the 
standard deviation for the height is 0.5 cm. and the distribution of heights 


is normal. 

The null hypothesis could be H,: yu The alternate hypothesis is H,: 
LL 

The words "is more than" translates asa" "so "pu " goes into the 


alternate hypothesis. The null hypothesis must contradict the alternate 
hypothesis. 

Since o is known (o cm.), the distribution for the population is 
known to be normal with mean jz and standard deviation = 


Suppose the null hypothesis is true (the mean height of the loaves is no 
more than 15 cm). Then is the mean height (17 cm) calculated from the 
sample unexpectedly large? The hypothesis test works by asking the 
question how unlikely the sample mean would be if the null hypothesis 
were true. The graph shows how far out the sample mean is on the normal 
curve. The p-value is the probability that, if we were to take other samples, 
any other sample mean would fall at least as far out as 17 cm. 

The p-value, then, is the probability that a sample mean is the same or 
greater than 17 cm. when the population mean is, in fact, 15 cm. We 
can calculate this probability using the normal distribution for means from 
Chapter 7. 


p-value is 
approximately 0 


15 17 


Pee which is approximately 0. 
A p-value of approximately 0 tells us that it is highly unlikely that a loaf of 
bread rises no more than 15 cm, on the average. That is, almost 0% of all 
loaves of bread would be at least as high as 17 cm. purely by CHANCE 
had the population mean height really been 15 cm. Because the outcome of 
17 cm. is so unlikely (meaning it is happening NOT by chance alone), 
we conclude that the evidence is strongly against the null hypothesis (the 
mean height is at most 15 cm.). There is sufficient evidence that the true 
mean height for the population of the baker's loaves of bread is greater than 
Seen 


Glossary 


Hypothesis 
A statement about the value of a population parameter. In case of two 
hypotheses, the statement assumed to be true is called the null 


hypothesis (notation H ) and the contradictory statement is called the 
alternate hypothesis (notation H,). 


p-value 
The probability that an event will happen purely by chance assuming 
the null hypothesis is true. The smaller the p-value, the stronger the 
evidence is against the null hypothesis. 


Standard Deviation 
A number that is equal to the square root of the variance and measures 
how far data values are from their mean. Notation: s for sample 
standard deviation and o for population standard deviation. 


12.7 Hypothesis Testing of Single Mean and Single Proportion: Decision 
and Conclusion TCC 


A systematic way to make a decision of whether to reject or not reject the 
null hypothesis is to compare the p-value and a preset or preconceived a 
(also called a "significance level"). A preset a is the probability of a Type 
I error (rejecting the null hypothesis when the null hypothesis is true). It 
may or may not be given to you at the beginning of the problem. 


When you make a decision to reject or not reject Ho, do as follows: 


« Ifa , reject H,. The results of the sample data are 
significant. There is sufficient evidence to conclude that H, is an 
incorrect belief and that the alternative hypothesis, H,, may be 
correct. 

° Ifa , do not reject H,. The results of the sample data are 
not significant. There is not sufficient evidence to conclude that the 
alternative hypothesis, H,, may be correct. 

e When you "do not reject H,", it does not mean that you should believe 
that H, is true. It simply means that the sample data have failed to 
provide sufficient evidence to cast serious doubt about the truthfulness 
offi 5. 


Conclusion: After you make your decision, write a thoughtful conclusion 
about the hypotheses in terms of the given problem. 


Glossary 


Hypothesis 
A statement about the value of a population parameter. In case of two 
hypotheses, the statement assumed to be true is called the null 
hypothesis (notation 7 ) and the contradictory statement is called the 
alternate hypothesis (notation H,). 


Level of Significance of the Test 
Probability of a Type I error (reject the null hypothesis when it is true). 
Notation: a. In hypothesis testing, the Level of Significance is called 


the preconceived a or the preset a. 


p-value 
The probability that an event will happen purely by chance assuming 
the null hypothesis is true. The smaller the p-value, the stronger the 
evidence is against the null hypothesis. 


Type 1 Error 
The decision is to reject the Null hypothesis when, in fact, the Null 
hypothesis is true. 


12.8 Hypothesis Testing of Single Mean and Single Proportion: Additional 
Information 


In a hypothesis test problem, you may see words such as "the level of 
significance is 1%." The "1%" is the preconceived or preset a. 

The statistician setting up the hypothesis test selects the value of a@ to 
use before collecting the sample data. 

If no level of significance is given, the accepted standard is to use 
a = 0.05. 

When you calculate the p-value and draw the picture, the p-value is 
the area in the left tail, the right tail, or split evenly between the two 
tails. For this reason, we call the hypothesis test left, right, or two 
tailed. 

The alternate hypothesis, H,, tells you if the test is left, right, or two- 
tailed. It is the key to conducting the appropriate test. 

HA, never has a symbol that contains an equal sign. 

Thinking about the meaning of the p-value: A data analyst (and 
anyone else) should have more confidence that he made the correct 
decision to reject the null hypothesis with a smaller p-value (for 
example, 0.001 as opposed to 0.04) even if using the 0.05 level for 
alpha. Similarly, for a large p-value like 0.4, as opposed to a p-value of 
0.056 (alpha = 0.05 is less than either number), a data analyst should 
have more confidence that she made the correct decision in failing to 
reject the null hypothesis. This makes the data analyst use judgment 
rather than mindlessly applying rules. 


The following examples illustrate a left, right, and two-tailed test. 


Example: 

donb 0) lal eeUh <5) 

Test of a single population mean. H, tells you the test is left-tailed. The 
picture of the p-value is as follows: 


p-value 


4 | 


Example: 

Hes eed pe Wee 

This is a test of a single population proportion. H, tells you the test is 
right-tailed. The picture of the p-value is as follows: 


p-value 


0.2 


Example: 

a: ff = 50 Ha? jb 50 

This is a test of a single population mean. H, tells you the test is two- 
tailed. The picture of the p-value is as follows. 


l l 
— (p-value — p-value) 
5 (p-value) 5 | 


4 | 


50 


Glossary 


Hypothesis Testing 
Based on sample evidence, a procedure to determine whether the 
hypothesis stated is a reasonable statement and cannot be rejected, or 
is unreasonable and should be rejected. 


p-value 
The probability that an event will happen purely by chance assuming 
the null hypothesis is true. The smaller the p-value, the stronger the 
evidence is against the null hypothesis. 


12.9 Hypothesis Testing of Single Mean and Single Proportion: Summary 
of the Hypothesis Test TCC 


The hypothesis test itself has an established process. This can be 
summarized as follows: 


1. Determine and _. Remember, they are contradictory. 

2. Determine the distribution for the test. 

3. Draw a graph, calculate the test statistic, and use the test statistic to 
calculate the p-value. (A z-score and a t-score are examples of test 
Statistics.) 

4. Compare the preconceived with the p-value, make a decision (reject 
or do not reject _+), and write a clear conclusion using English 


sentences. 
Notice that in performing the hypothesis test, you use andnot ._ is 
needed to help determine the sample size of the data that is used in 
calculating the p-value. Remember that the quantity is called the 


Power of the Test. A high power is desirable. If the power is too low, 
Statisticians typically increase the sample size while keeping the same. If 
the power is low, the null hypothesis might not be rejected when it should 
be. 


Glossary 


Hypothesis Testing 
Based on sample evidence, a procedure to determine whether the 
hypothesis stated is a reasonable statement and cannot be rejected, or 
is unreasonable and should be rejected. 


p-value 
The probability that an event will happen purely by chance assuming 
the null hypothesis is true. The smaller the p-value, the stronger the 
evidence is against the null hypothesis. 


12.10 Hypothesis Testing of Single Mean and Single Proportion: Examples 
TCC 

This module provides examples of Hypothesis Testing of a Single Mean 
and a Single Proportion as a part of the Collaborative Statistics collection 
(col10522) by Barbara Illowsky and Susan Dean. 


Example: 
Exercise: 


Problem: 


Jeffrey, as an eight-year old, established a mean time of 16.43 
seconds for swimming the 25-yard freestyle, with a standard 
deviation of 0.8 seconds. His dad, Frank, thought that Jeffrey could 
swim the 25-yard freestyle faster by using goggles. Frank bought 
Jeffrey a new pair of expensive goggles and timed Jeffrey for 15 25- 
yard freestyle swims. For the 15 swims, Jeffrey's mean time was 16 
seconds. Frank thought that the goggles helped Jeffrey to swim 
faster than the 16.43 seconds. Conduct a hypothesis test using a 
preset a = 0.05. Assume that the swim times for the 25-yard 
freestyle are normal. 


Solution: 
Set up the Hypothesis Test: 


Since the problem is about a mean, this is a test of a single 
population mean. 


Hoiu=16.43 Haw < 16.43 


For Jeffrey to swim faster, his time will be less than 16.43 seconds. 
The "<" tells you this is left-tailed. 


Determine the distribution needed: 


Random variable: X = the mean time to swim the 25-yard freestyle. 


Distribution for the test: X is normal (population standard 
deviation is known: o = 0.8) 


ee N(n, x) Therefore, X ~ N (16.43, 8. 


jt = 16.43 comes from AH and not the data. o = 0.8, andn = 15. 
Calculate the p-value using the normal distribution for a mean: 
p-value = P(a << 16) = 0.0187 where the sample mean in the 
problem is given as 16. 


p-value = 0.0187 (This is called the actual level of significance.) 
The p-value is the area to the left of the sample mean is given as 16. 


Graph: 
p-value - 
x= 16 
nw = 16.43 
16 16.43 ss 


jt = 16.43 comes from H,. Our assumption is wy = 16.43. 
Interpretation of the p-value: If H, is true, there is a 0.0187 
probability (1.87%) that Jeffrey's mean time to swim the 25-yard 
freestyle is 16 seconds or less. Because a 1.87% chance is small, the 


mean time of 16 seconds or less is unlikely to have happened 
randomly. It is a rare event. 


Compare q@ and the p-value: 
a = 0.05 p-value = 0.0187 a > p-value 


Make a decision: Since a > p-value, reject Ho. 


This means that you reject 44 = 16.43. In other words, you do not 
think Jeffrey swims the 25-yard freestyle in 16.43 seconds but faster 
with the new goggles. 


Conclusion: At the 5% significance level, we conclude that Jeffrey 
swims faster using the new goggles. The sample data show there is 
sufficient evidence that Jeffrey's mean time to swim the 25-yard 
freestyle is less than 16.43 seconds. 


The p-value can easily be calculated using the TI-83+ and the TI-84 
calculators: 


Press STAT and arrow over to TESTS. Press 1:Z-Test. Arrow over 
to Stats and press ENTER. Arrow down and enter 16.43 for fp (null 
hypothesis), .8 for 0, 16 for the sample mean, and 15 for n. Arrow 
down to p: (alternate hypothesis) and arrow over to <fg. Press 
ENTER. Arrow down to Calculate and press ENTER. The 
calculator not only calculates the p-value (p = 0.0187) but it also 
calculates the test statistic (z-score) for the sample mean.  < 16.43 
is the alternate hypothesis. Do this set of instructions again except 
arrow to Draw (instead of Calculate). Press ENTER. A shaded 
graph appears with z = —2.08 (test statistic) and p = 0.0187 (p- 
value). Make sure when you use Dr aw that no other equations are 
highlighted in Y = and the plots are turned off. 


When the calculator does a Z-Test, the Z- Test function finds the p- 
value by doing a normal probability calculation using the Central 
Limit Theorem: 


2 -op—2nd DISTR normcdf 
(—10 WHA 16.43, 0.8/V15). 
The Type I and Type IJ errors for this problem are as follows: 


The Type I error is to conclude that Jeffrey swims the 25-yard 
freestyle, on average, in less than 16.43 seconds when, in fact, he 


actually swims the 25-yard freestyle, on average, in 16.43 seconds. 
(Reject the null hypothesis when the null hypothesis is true.) 


The Type II error is that there is not evidence to conclude that Jeffrey 
swims the 25-yard free-style, on average, in less than 16.43 seconds 
when, in fact, he actually does swim the 25-yard free-style, on 
average, in less than 16.43 seconds. (Do not reject the null hypothesis 
when the null hypothesis is false.) 


Historical Note: The traditional way to compare the two probabilities, a 
and the p-value, is to compare the critical value (z-score from q) to the test 
Statistic (z-score from data). The calculated test statistic for the p-value is 
—2.08. (From the Central Limit Theorem, the test statistic formula is 


(=) . For this problem, x = 16, wx = 16.43 from the null 
hypothesis, ox = 0.8, and m = 15.) You can find the critical value for 

a = 0.05 in the normal table (see 15.Tables in the Table of Contents). The 
z-score for an area to the left equal to 0.05 is midway between -1.65 and 
-1.64 (0.05 is midway between 0.0505 and 0.0495). The z-score is -1.645. 
Since —1.645 > —2.08 (which demonstrates that @ > p-value), reject Ho. 
Traditionally, the decision to reject or not reject was done in this way. 
Today, comparing the two probabilities a and the p-value is very common. 
For this problem, the p-value, 0.0187 is considerably smaller than a, 0.05. 
You can be confident about your decision to reject. The graph shows a, the 
p-value, and the test statistics and the critical value. 


— 


p-value = 0.0187 


-2.08 -1.645 0 


Example: 
Exercise: 


Problem: 


A college football coach thought that his players could bench press a 
mean weight of 275 pounds. It is known that the standard deviation 
is 55 pounds. Three of his players thought that the mean weight was 
more than that amount. They asked 30 of their teammates for their 
estimated maximum lift on the bench press exercise. The data ranged 
from 205 pounds to 385 pounds. The actual different weights were 
(frequencies are in parentheses) 205(3) 215(3) 225(1) 241(2) 252(2) 
265(2) 275(2) 313(2) 316(5) 338(2) 341(1) 345(2) 368(2) 385(1). 
(Source: data from Reuben Davis, Kraig Evans, and Scott 
Gunderson. ) 


Conduct a hypothesis test using a 2.5% level of significance to 
determine if the bench press mean is more than 275 pounds. 


Solution: 
Set up the Hypothesis Test: 


Since the problem is about a mean weight, this is a test of a single 
population mean. 


Hi — 2 J aT eT hS) This is a right-tailed test. 
Calculating the distribution needed: 


Random variable: X = the mean weight, in pounds, lifted by the 
football players. 


Distribution for the test: It is normal because o is known. 


z 55 
Kaen) (275, $5.) 


x = 286.2 pounds (from the data). 


o = 55 pounds (Always use o if you know it.) We assume fz = 275 
pounds unless our data shows us otherwise. 


Calculate the p-value using the normal distribution for a mean and 
using the sample mean as input (see the calculator instructions below 
for using the data as input): 


p-value = P( x > 286.2) = 0.1323. 


Interpretation of the p-value: If H, is true, then there is a 0.1331 
probability (13.23%) that the football players can lift a mean weight 
of 286.2 pounds or more. Because a 13.23% chance is large enough, a 
mean weight lift of 286.2 pounds or more is not a rare event. 


x= 286.2 p-value = 0.1323 
w= 275 


275 286.2 x 


Compare a and the p-value: 
w= 0025 p-value = 0.1323 
Make a decision: Since a<p-value, do not reject H,. 


Conclusion: At the 2.5% level of significance, from the sample data, 
there is not sufficient evidence to conclude that the true mean weight 
lifted is more than 275 pounds. 


The p-value can easily be calculated using the TI-83+ and the TI-84 
calculators: 


Put the data and frequencies into lists. Press STAT and arrow over to 
TESTS. Press 1:Z-Test. Arrow over to Data and press ENTER. 


Arrow down and enter 275 for 49, 55 for a, the name of the list where 
you put the data, and the name of the list where you put the 
frequencies. Arrow down to p : and arrow over to > [Up. Press 
ENTER. Arrow down to Calculate and press ENTER. The 
calculator not only calculates the p-value (p = 0.1331, a little 
different from the above calculation - in it we used the sample mean 
rounded to one decimal place instead of the data) but it also calculates 
the test statistic (z-score) for the sample mean, the sample mean, and 
the sample standard deviation. 4 > 275 is the alternate hypothesis. 
Do this set of instructions again except arrow to Dr aw (instead of 
Calculate). Press ENTER. A shaded graph appears with z = 1.112 
(test statistic) and p = 0.1331 (p-value). Make sure when you use 
Draw that no other equations are highlighted in Y = and the plots are 
turned off. 


Example: 
Exercise: 


Problem: 


Statistics students believe that the mean score on the first statistics test 
is 65. A statistics instructor thinks the mean score is higher than 65. 
He samples ten statistics students and obtains the scores 65 65 70 67 
66 63 63 68 72 71. He performs a hypothesis test using a 5% level of 
significance. The data are from a normal distribution. 


Solution: 
Set up the Hypothesis Test: 


A 5% level of significance means that a = 0.05. This is a test of a 
single population mean. 


Heo — op aie = 65 


Since the instructor thinks the average score is higher, use a ">". The 
">" means the test is right-tailed. 


Determine the distribution needed: 

Random variable: X = average score on the first statistics test. 
Distribution for the test: If you read the problem carefully, you will 
notice that there is no population standard deviation given. You are 
only given n = 10 sample data values. Notice also that the data come 
from a normal distribution. This means that the distribution for the 


test is a student's-t. 


Use tar. Therefore, the distribution for the test is tj) where n = 10 and 
df =10—-1=9. 


Calculate the p-value using the Student's-t distribution: 


p-value = P( x > 67 )= 0.0396 where the sample mean and sample 
standard deviation are calculated as 67 and 3.1972 from the data. 


Interpretation of the p-value: If the null hypothesis is true, then 


there is a 0.0396 probability (3.96%) that the sample mean is 67 or 
more. 


p-value = 0.0396 


65 67 * 


Compare q@ and the p-value: 


Since a = .05 and p-value = 0.0396. Therefore, a > p-value. 


Make a decision: Since a > p-value, reject Ho. 


This means you reject p42 = 65. In other words, you believe the 
average test score is more than 65. 


Conclusion: At a 5% level of significance, the sample data show 
sufficient evidence that the mean (average) test score is more than 65, 
just as the math instructor thinks. 


The p-value can easily be calculated using the TI-83+ and the TI-84 
calculators: 


Put the data into a list. Press STAT and arrow over to TESTS. Press 
2:T-Test. Arrow over to Data and press ENTER. Arrow down and 
enter 65 for jz, the name of the list where you put the data, and 1 for 
Freq:. Arrow down to p : and arrow over to > fo. Press ENTER. 
Arrow down to Calculate and press ENTER. The calculator not 
only calculates the p-value (p = 0.0396) but it also calculates the test 
Statistic (t-score) for the sample mean, the sample mean, and the 
sample standard deviation. jz > 65 is the alternate hypothesis. Do this 
set of instructions again except arrow to Dr aw (instead of 
Calculate). Press ENTER. A shaded graph appears with 

t = 1.9781 (test statistic) and p = 0.0396 (p-value). Make sure when 
you use Draw that no other equations are highlighted in Y = and the 
plots are turned off. 


Example: 
Exercise: 


Problem: 


Joon believes that 50% of first-time brides in the United States are 
younger than their grooms. She performs a hypothesis test to 
determine if the percentage is the same or different from 50%. Joon 
samples 100 first-time brides and 53 reply that they are younger than 
their grooms. For the hypothesis test, she uses a 1% level of 
significance. 


Solution: 
Set up the Hypothesis Test: 


The 1% level of significance means that a = 0.01. This is a test of a 
single population proportion. 


Hep ol) in ipra oe = 0p a9) 


The words "is the same or different from" tell you this is a two- 
tailed test. 


Calculate the distribution needed: 


Random variable: P/ = the percent of of first-time brides who are 
younger than their grooms. 


Distribution for the test: The problem contains no mention of a 


mean. The information is given in terms of percentages. Use the 
distribution for P/, the estimated proportion. 


Pi~N (», 4/ = Therefore, Pr ~ N (0, ae where 
p— 0: 50eg— bp 0 50 anda — 100; 
Calculate the p-value using the normal distribution for proportions: 


p-value = P(p-hat< 0.47 or p-hat > 0.53 ) = 0.5485 


where x = 53, p-hat = = = an = 53: 


Interpretation of the p-value: If the null hypothesis is true, there is 
0.5485 probability (54.85%) that the sample (estimated) proportion 
p-hat is 0.53 or more OR 0.47 or less (see the graph below). 


= (p-value) = 0.27425 = (p-value) = 0.27425 


0.47 0.50 0.53 


pt = p = 0.50 comes from Ho, the null hypothesis. 


p-hat= 0.53. Since the curve is symmetrical and the test is two- 
tailed, the p-hat for the left tail is equal to 0.50 — 0.03 = 0.47 where 
pL = p = 0.50. (0.03 is the difference between 0.53 and 0.50.) 


Compare q@ and the p-value: 
Since a = 0.01 and p-value = 0.5485. Therefore, a< p-value. 
Make a decision: Since a<p-value, you cannot reject Ho. 


Conclusion: At the 1% level of significance, the sample data do not 
show sufficient evidence that the percentage of first-time brides that 
are younger than their grooms is different from 50%. 


The p-value can easily be calculated using the TI-83+ and the TI-84 
calculators: 


Press STAT and arrow over to TESTS. Press 5:1-PropZTest. 
Enter .5 for po, 53 for x and 100 for n. Arrow down to Prop and 
arrow tonot equals po. Press ENTER. Arrow down to 


Calculate and press ENTER. The calculator calculates the p-value 
(p = 0.5485) and the test statistic (z-score). Prop not equals .5 
is the alternate hypothesis. Do this set of instructions again except 
arrow to Draw (instead of Calculate). Press ENTER. A shaded 
graph appears with z = 0.6 (test statistic) and p = 0.5485 (p-value). 
Make sure when you use Dr aw that no other equations are 
highlighted in Y = and the plots are turned off. 


The Type I and Type IJ errors are as follows: 


The Type I error is to conclude that the proportion of first-time brides 
that are younger than their grooms is different from 50% when, in 
fact, the proportion is actually 50%. (Reject the null hypothesis when 
the null hypothesis is true). 


The Type II error is there is not enough evidence to conclude that the 
proportion of first time brides that are younger than their grooms 
differs from 50% when, in fact, the proportion does differ from 50%. 
(Do not reject the null hypothesis when the null hypothesis is false.) 


Example: 
Exercise: 


Problem: 


Suppose a consumer group suspects that the proportion of households 
that have three cell phones is 30%. A cell phone company has reason 
to believe that the proportion is not 30%. Before they start a big 
advertising campaign, they conduct a hypothesis test. Their marketing 
people survey 150 households with the result that 43 of the 
households have three cell phones. 


Solution: 


Set up the Hypothesis Test: 


Hp ol) ipo) 
Determine the distribution needed: 


The random variable is P/ = proportion of households that have 
three cell phones. 


The distribution for the hypothesis test is P/ ~ N 


(0.30)-(0.70) 
(0.30, ee | 


Exercise: 


Problem: 


The value that helps determine the p-value is p-hat. Calculate 
p-hat. 


Solution: 


p-hat = = where x is the number of successes and 7 is the 
total number in the sample. 


i Aa ne 150 


_ 43 


p-hat i 


Exercise: 
Problem: What is a success for this problem? 


Solution: 


A success is having three cell phones in a household. 


Exercise: 


Problem: What is the level of significance? 


Solution: 


The level of significance is the preset a. Since a is not given, 
assume that a = 0.05. 


Draw the graph for this problem. Draw the horizontal axis. Label and 
shade appropriately. 
Exercise: 


Problem: Calculate the p-value. 


Solution: 


p-value = 0.7216 
Exercise: 


Problem: 


Make a decision. (Reject/Do not reject) Ho 
because 


Solution: 


Assuming that a = 0.05, a < p-value. The Decision is do not 
reject Ho because there is not sufficient evidence to conclude 
that the proportion of households that have three cell phones is 
not 30%. 


The next example is a poem written by a statistics student named Nicole 
Hart. The solution to the problem follows the poem. Notice that the 
hypothesis test is for a single population proportion. This means that the 
null and alternate hypotheses use the parameter p. The distribution for the 
test is normal. The estimated proportion p-hat is the proportion of fleas 
killed to the total fleas found on Fido. This is sample information. The 
problem gives a preconceived a = 0.01, for comparison, and a 95% 


confidence interval computation. The poem is clever and humorous, so 
please enjoy it! 


Note:Hypothesis testing problems consist of multiple steps. To help you do 
the problems, solution sheets are provided for your use. Look in the Table 
of Contents Appendix for the topic "Solution Sheets." If you like, use 
copies of the appropriate solution sheet for homework problems. 


Example: 
Exercise: 


Problem: 


My dog has so many fleas, They do not come off 
with ease. As for shampoo, I have tried many 
types Even one called Bubble Hype, Which only 
killed 25% of the fleas, Unfortunately I was 
not pleased. I've used all kinds of soap, Until 
I had give up hope Until one day I saw An ad 
that put me in awe. A shampoo used for dogs 
Called GOOD ENOUGH to Clean a Hog Guaranteed to 
Kill more fleas. I gave Fido a bath And after 
doing the math His number of fleas Started 
dropping by 3's! Before his shampoo I counted 
42. At the end of his bath, I redid the math 
And the new shampoo had killed 17 fleas. So now 
I was pleased. Now it is time for you to have 
some fun With the level of significance being 
.01, You must help me figure out Use the new 
shampoo or go without? 


Solution: 


Set up the Hypothesis Test: 


Jali NAS Hep = 020 
Determine the distribution needed: 


In words, CLEARLY state what your random variable X or P’ 
represents. 


P’ = The proportion of fleas that are killed by the new shampoo 


State the distribution to use for the test. 
. (0.25)(1—0.25) 
Normal: w (0.25, joa ) 


Test Statistic: z = 2.3163 
Calculate the p-value using the normal distribution for proportions: 
p-value =0.0103 


In 1 — 2 complete sentences, explain what the p-value means for this 
problem. 


If the null hypothesis is true (the proportion is 0.25), then there is a 
0.0103 probability that the sample (estimated) proportion is 0.4048 
(45) or more. 


Use the previous information to sketch a picture of this situation. 
CLEARLY, label and scale the horizontal axis and shade the region(s) 
corresponding to the p-value. 


p' 
0.25 17/42= _ test statistic for 
0.4048 17/42: 2.3163 


Compare q@ and the p-value: 


Indicate the correct decision (“reject” or “do not reject” the null 
hypothesis), the reason for it, and write an appropriate conclusion, 
using COMPLETE SENTENCES. 


alpha decision reason for decision 


0.01 Do not reject H, a<p-value 


Conclusion: At the 1% level of significance, the sample data do not 
show sufficient evidence that the percentage of fleas that are killed by 
the new shampoo is more than 25%. 


Construct a 95% Confidence Interval for the true mean or proportion. 
Include a sketch of the graph of the situation. Label the point estimate 
and the lower and upper bounds of the Confidence Interval. 


p' 


0.26 17/42 0.55 


Confidence Interval: (0.26, 0.55) We are 95% confident that the true 
population proportion p of fleas that are killed by the new shampoo is 
between 26% and 55%. 


Note: This test result is not very definitive since the p-value is very 
close to alpha. In reality, one would probably do more tests by giving 


the dog another bath after the fleas have had a chance to return. 


Glossary 


Central Limit Theorem 
Given a random variable (RV) with known mean p and known 
standard deviation o. We are sampling with size n and we are 
interested in two new RVs - the sample mean, X, and the sample sum, 
»/X. If the size n of the sample is sufficiently large, then X ~ 


N(n, =) and XX ~ N(nu, V/no). If the size n of the sample is 


sufficiently large, then the distribution of the sample means and the 
distribution of the sample sums will approximate a normal distribution 
regardless of the shape of the population. The mean of the sample 
means will equal the population mean and the mean of the sample 
sums will equal n times the population mean. The standard deviation 
of the distribution of the sample means, Tae is called the standard 


error of the mean. 


Standard Deviation 
A number that is equal to the square root of the variance and measures 
how far data values are from their mean. Notation: s for sample 
standard deviation and o for population standard deviation. 


12.11 Hypothesis Testing of Single Mean and Single Proportion: Summary 
of Formulas TCC 


A, and H, are contradictory. 


H, has: equal (=) 


then H, has: not equal (+) or greater than (>) or less than (<) 


If a < p-value, then do not reject Ho. 
If a > p-value, then reject H, . 


a is preconceived. Its value is set before the hypothesis test starts. The p- 
value is calculated from the data. 


qa = probability of a Type I error = P(Type I error) = probability of rejecting 
the null hypothesis when the null hypothesis is true. 


@ = probability of a Type II error = P(Type II error) = probability of not 
rejecting the null hypothesis when the null hypothesis is false. 


If there is no given preconceived a, then use a = 0.05. 
Types of Hypothesis Tests 


e Single population mean, known population variance (or standard 
deviation): Normal test. 

e Single population mean, unknown population variance (or standard 
deviation): Student's-t test. 

e Single population proportion: Normal test. 


12.12 Hypothesis Testing of Single Mean and Single Proportion: Practice 1 
This module provides a practice of Hypothesis Testing of Single Mean and 
Single Proportion as a part of Collaborative Statistics collection (col10522) 
by Barbara Illowsky and Susan Dean. 


Student Learning Outcomes 


e The student will conduct a hypothesis test of a single mean with 
known population standard deviation. 


Given 


Suppose that a recent article stated that the mean time spent in jail by a 
first-time convicted burglar is 2.5 years. A study was then done to see if the 
mean time has increased in the new century. A random sample of 26 first— 
time convicted burglars in a recent year was picked. The mean length of 
time in jail from the survey was 3 years with a standard deviation of 1.8 
years. Suppose that it is somehow known that the population standard 
deviation is 1.5. Conduct a hypothesis test to determine if the mean length 
of jail time has increased. The distribution of the population is normal. 


Hypothesis Testing: Single Mean 


Exercise: 


Problem: Is this a test of means or proportions? 


Solution: 
Means 
Exercise: 
Problem: State the null and alternative hypotheses. 


e aH: 


e b dZ.:: 


Solution: 


© aH, u (or, H, pu ) 
° bH, pu 


Exercise: 
Problem: 


Is this a right-tailed, left-tailed, or two-tailed test? How do you know? 


Solution: 
right-tailed 


Exercise: 


Problem: What symbol represents the Random Variable for this test? 


Solution: 
xX 


Exercise: 


Problem: In words, define the Random Variable for this test. 


Solution: 


The mean time spent in jail for 26 first time convicted burglars 
Exercise: 


Problem: 


Is the population standard deviation known and, if so, what is it? 


Solution: 
Yes, 1.5 
Exercise: 
Problem:Calculate the following: 
e az 
e bo 
° CS,y 
edn 
Solution: 
e a3 
e b1.5 


e c1.8 
e d26 


Exercise: 


Problem: 


Since both o and s, are given, which should be used? In 1 -2 complete 
sentences, explain why. 


Solution: 


oO 


Exercise: 


Problem: State the distribution to use for the hypothesis test. 


Solution: 


X N = 


Exercise: 
Problem: 
Sketch a graph of the situation. Label the horizontal axis. Mark the 


hypothesized mean and the sample mean x. Shade the area 
corresponding to the p-value. 


= 
x 


Exercise: 


Problem: Find the p-value. 
Solution: 
0.0446 
Exercise: 
Problem: At a pre-conceived @ , what is your: 


e a Decision: 
e b Reason for the decision: 
e c Conclusion (write out in a complete sentence): 


Solution: 


e aReject the null hypothesis 


Discussion Questions 


Exercise: 


Problem: 


Does it appear that the mean jail time spent for first time convicted 
burglars has increased? Why or why not? 


12.13 Hypothesis Testing of Single Mean and Single Proportion: Practice 2 
LEC 

This module provides a practice of Hypothesis Testing of Single Mean and 
Single Proportion as a part of Collaborative Statistics collection (col10522) 
by Barbara Illowsky and Susan Dean. 


Student Learning Outcomes 


e The student will conduct a hypothesis test of a single mean with 
unknown population standard deviation. 


Given 
A random survey of 75 death row inmates revealed that the mean length of 
time on death row is 17.4 years with a standard deviation of 6.3 years. 


Conduct a hypothesis test to determine if the population mean time on death 
row could likely be 15 years. 


Hypothesis Testing: Single Mean 


Exercise: 


Problem: Is this a test of means or proportions? 


Solution: 
averages 
Exercise: 
Problem: State the null and alternative hypotheses. 


e aH, 
e bH, 


Solution: 


© aH, u 
© bH, u 


Exercise: 
Problem: 
Is this a right-tailed, left-tailed, or two-tailed test? How do you know? 
Solution: 
two-tailed 

Exercise: 
Problem: What symbol represents the Random Variable for this test? 
Solution: 
X 

Exercise: 
Problem: In words, define the Random Variable for this test. 
Solution: 


the mean time spent on death row for the 75 inmates 
Exercise: 


Problem: 
Is the population standard deviation known and, if so, what is it? 


Solution: 


No 
Exercise: 

Problem: Calculate the following: 
° az 
eb 
ecn 

Solution: 
e al7.4 


e bs 
e c/5 


Exercise: 
Problem: 
Which test should be used? In 1 -2 complete sentences, explain why. 
Solution: 
t test 


Exercise: 


Problem: State the distribution to use for the hypothesis test. 


Solution: 


t 


Exercise: 


Problem: 
Sketch a graph of the situation. Label the horizontal axis. Mark the 


hypothesized mean and the sample mean, x. Shade the area 
corresponding to the p-value. 


== eee 
: 
Exercise: 


Problem: Find the p-value. 


Solution: 
0.0015 
Exercise: 

Problem: At a pre-conceived a , what is your: 
e a Decision: 
¢ b Reason for the decision: 
e c Conclusion (write out in a complete sentence): 

Solution: 


e aReject the null hypothesis 


Discussion Question 


Does it appear that the mean time on death row could be 15 years? Why or 
why not? 


12.14 Hypothesis Testing of Single Mean and Single Proportion: Practice 3 
TCC 

This module provides a practice of Hypothesis Testing of Single Mean and 
Single Proportion as a part of Collaborative Statistics collection (col10522) 
by Barbara Illowsky and Susan Dean. 


Student Learning Outcomes 


e The student will conduct a hypothesis test of a single population 
proportion. 


Given 


The National Institute of Mental Health published an article stating that in 
any one-year period, approximately 9.5 percent of American adults suffer 
from depression or a depressive illness. 
(http://www.nimh.nih.gov/publicat/depression.cfm) Suppose that in a 
survey of 100 people in a certain town, seven of them suffered from 
depression or a depressive illness. Conduct a hypothesis test to determine if 
the true proportion of people in that town suffering from depression or a 
depressive illness is lower than the percent in the general adult American 
population. 


Hypothesis Testing: Single Proportion 


Exercise: 


Problem: Is this a test of means or proportions? 


Solution: 


Proportions 


Exercise: 


Problem: State the null and alternative hypotheses. 


e 
ao g 


Solution: 


a 
eb 
Exercise: 


Problem: 


Is this a right-tailed, left-tailed, or two-tailed test? How do you know? 


Solution: 


left-tailed 


Exercise: 
Problem: What symbol represents the Random Variable for this test? 


Solution: 


Exercise: 
Problem: In words, define the Random Variable for this test. 


Solution: 


the proportion of people in that town surveyed suffering from 
depression or a depressive illness 


Exercise: 


Problem:Calculate the following: 


e 
a & & 


Solution: 


e a/ 
e b100 
e c0.07 


Exercise: 


Problem: 


Calculate — . Make sure to show how you set up the formula. 


Solution: 


0.0293 


Exercise: 


Problem: State the distribution to use for the hypothesis test. 


Solution: 


Normal 
Exercise: 
Problem: 
Sketch a graph of the situation. Label the horizontal axis. Mark the 


hypothesized mean and the sample proportion, p-hat. Shade the area 
corresponding to the p-value. 


P' 
Exercise: 


Problem: Find the p-value 
Solution: 
0.1969 
Exercise: 
Problem: At a pre-conceived , what is your: 


e a Decision: 
¢ b Reason for the decision: 
¢ c Conclusion (write out in a complete sentence): 


Solution: 


e aDo not reject the null hypothesis 


Discusion Question 


Exercise: 
Problem: 
Does it appear that the proportion of people in that town with 


depression or a depressive illness is lower than general adult American 
population? Why or why not? 


12.15 Hypothesis Testing of Single Mean and Single Proportion: 
Homework TCC 

This module provides a homework of Hypothesis Testing of Single Mean 
and Single Proportion as a part of Collaborative Statistics collection 
(col10522) by Barbara Illowsky and Susan Dean. 

Exercise: 


Problem: 


Some of the statements below refer to the null hypothesis, some to the 
alternate hypothesis. 


State the null hypothesis, H,, and the alternative hypothesis, H, in 
terms of the appropriate parameter (jz or p). 


e aThe mean number of years Americans work before retiring is 34. 

e bAt most 60% of Americans vote in presidential elections. 

e cThe mean starting salary for San Jose State University graduates 
is at least $100,000 per year. 

e d29% of high school seniors get drunk each month. 

e eFewer than 5% of adults ride the bus to work in Los Angeles. 

e fThe mean number of cars a person owns in her lifetime is not 
more than 10. 

e gAbout half of Americans prefer to live away from cities, given 
the choice. 

e hEuropeans have a mean paid vacation each year of six weeks. 

e iThe chance of developing breast cancer is under 11% for women. 

e jPrivate universities mean tuition cost is more than $20,000 per 
year. 


Solution: 


© aH: = 34; Hy: F 34 

¢ c H,:u > 100,000 ; H,:u < 100,000 
« @€H,-p= 0.29; Hp 40.29 

‘eH p= 0-503 A, p+ -050 

¢ i Ho:p > 0.11; Ha:p < 0.11 


Exercise: 


Problem: 


For (a) - (j) above, state the Type I and Type II errors in complete 
sentences. 


Solution: 


e alype I error: We conclude that the mean is not 34 years, when it 
really is 34 years. Type II error: We do not conclude that the mean 
is not 34 years, when it is not really 34 years. 

¢ clype I error: We conclude that the mean is less than $100,000, 
when it really is at least $100,000. Type II error: We do not 
conclude that the mean is less than $100,000, when it is really 
less than $100,000. 

e dType I error: We conclude that the proportion of h.s. seniors who 
get drunk each month is not 29%, when it really is 29%. Type II 
error: We do not conclude that the proportion of h.s. seniors that 
get drunk each month is not 29%, when it is really not 29%. 

e iType I error: We conclude that the proportion is less than 11%, 
when it is really at least 11%. Type II error: We do not conclude 
that the proportion is less than 11%, when it really is less than 
11%. 


Exercise: 


Problem: For (a) - (j) above, in complete sentences: 


e aState a consequence of committing a Type I error. 
e bState a consequence of committing a Type II error. 


Note:For each of the word problems, use a solution sheet to do the 
hypothesis test. The solution sheet is found in 14. Appendix (online book 


version: the link is "Solution Sheets"; PDF book version: look under 14.5 
Solution Sheets). Please feel free to make copies of the solution sheets. For 
the online version of the book, it is suggested that you copy the .doc or the 
.pdf files. 


Note:If you are using a student's-t distribution for a homework problem 
below, you may assume that the underlying population is normally 
distributed. (In general, you must first prove that assumption, though.) 


Exercise: 


Problem: 


A particular brand of tires claims that its deluxe tire averages at least 
50,000 miles before it needs to be replaced. From past studies of this 
tire, the standard deviation is known to be 8000. A survey of owners of 
that tire design is conducted. From the 28 tires surveyed, the mean 
lifespan was 46,500 miles with a standard deviation of 9800 miles. Do 
the data support the claim at the 5% level? 


Exercise: 


Problem: 


From generation to generation, the mean age when smokers first start 
to smoke varies. However, the standard deviation of that age remains 
constant of around 2.1 years. A survey of 40 smokers of this 
generation was done to see if the mean starting age is at least 19. The 
sample mean was 18.1 with a sample standard deviation of 1.3. Do the 
data support the claim at the 5% level? 


Solution: 


eez— —2.71 
e £0.0034 
e hDecision: Reject null; Conclusion: uw < 19 


e i (17.449,18.757) 


Exercise: 


Problem: 


The cost of a daily newspaper varies from city to city. However, the 
variation among prices remains steady with a standard deviation of 
20¢. A study was done to test the claim that the mean cost of a daily 
newspaper is $1.00. Twelve costs yield a mean cost of 95¢ with a 
standard deviation of 18¢. Do the data support the claim at the 1% 
level? 


Exercise: 


Problem: 


An article in the San Jose Mercury News stated that students in the 
California state university system take 4.5 years, on average, to finish 
their undergraduate degrees. Suppose you believe that the mean time is 
longer. You conduct a survey of 49 students and obtain a sample mean 
of 5.1 with a sample standard deviation of 1.2. Do the data support 
your claim at the 1% level? 


Solution: 


° e3.5 

¢ f0.0005 

e hDecision: Reject null; Conclusion: > 4.5 
e i (4.7553,5.4447) 


Exercise: 


Problem: 


The mean number of sick days an employee takes per year is believed 
to be about 10. Members of a personnel department do not believe this 
figure. They randomly survey 8 employees. The number of sick days 
they took for the past year are as follows: 12; 4; 15; 3; 11; 8; 6; 8. Let 
x = the number of sick days they took for the past year. Should the 
personnel team believe that the mean number is about 10? 


Exercise: 


Problem: 


In 1955, Life Magazine reported that the 25 year-old mother of three 
worked, on average, an 80 hour week. Recently, many groups have 
been studying whether or not the women's movement has, in fact, 
resulted in an increase in the average work week for women 
(combining employment and at-home work). Suppose a study was 
done to determine if the mean work week has increased. 81 women 
were surveyed with the following results. The sample mean was 83; 
the sample standard deviation was 10. Does it appear that the mean 
work week has increased for women at the 5% level? 


Solution: 


e e2.7 

e £0.0042 

e hDecision: Reject Null 
e i (80.789,85.211) 


Exercise: 


Problem: 


Your statistics instructor claims that 60 percent of the students who 
take her Elementary Statistics class go through life feeling more 
enriched. For some reason that she can't quite figure out, most people 
don't believe her. You decide to check this out on your own. You 
randomly survey 64 of her past Elementary Statistics students and find 
that 34 feel more enriched as a result of her class. Now, what do you 
think? 


Exercise: 


Problem: 


A Nissan Motor Corporation advertisement read, “The average man’s 
I.Q. is 107. The average brown trout’s I.Q. is 4. So why can’t man 
catch brown trout?” Suppose you believe that the brown trout’s mean 
I.Q. is greater than 4. You catch 12 brown trout. A fish psychologist 
determines the I.Q.s as follows: 5; 4; 7; 3; 6; 4; 5; 3; 6; 3; 8; 5. 
Conduct a hypothesis test of your belief. 


Solution: 


e dti 

e e1.96 

e £0.0380 

e hDecision: Reject null when a = 0.05 ; do not reject null when 
a = 0.01 

e i (3.8865,5.9468) 


Exercise: 
Problem: 
Refer to the previous problem. Conduct a hypothesis test to see if your 


decision and conclusion would change if your belief were that the 
brown trout’s mean I.Q. is not 4. 


Exercise: 


Problem: 


According to an article in Newsweek, the natural ratio of girls to boys 
is 100:105. In China, the birth ratio is 100: 114 (46.7% girls). Suppose 
you don’t believe the reported figures of the percent of girls born in 
China. You conduct a study. In this study, you count the number of 
girls and boys born in 150 randomly chosen recent births. There are 60 
girls and 90 boys born of the 150. Based on your study, do you believe 
that the percent of girls born in China is 46.7? 


Solution: 


e e-1.64 
e £0.1000 


e hDecision: Do not reject null 
e i (0.3216,0.4784) 


Exercise: 


Problem: 


A poll done for Newsweek found that 13% of Americans have seen or 
sensed the presence of an angel. A contingent doubts that the percent is 
really that high. It conducts its own survey. Out of 76 Americans 
surveyed, only 2 had seen or sensed the presence of an angel. As a 
result of the contingent’s survey, would you agree with the Newsweek 
poll? In complete sentences, also give three reasons why the two polls 
might give different results. 


Exercise: 


Problem: 


The mean work week for engineers in a start-up company is believed 
to be about 60 hours. A newly hired engineer hopes that it’s shorter. 
She asks 10 engineering friends in start-ups for the lengths of their 
mean work weeks. Based on the results that follow, should she count 
on the mean work week to be shorter than 60 hours? 


Data (length of mean work week): 70; 45; 55; 60; 65; 55; 55; 60; 50; 
aoe 


Solution: 


id dto 

e e-1.33 

e £0.1086 

e hDecision: Do not reject null 
e i (51.886,62.114) 


Exercise: 
Problem: 
Use the “Lap time” data for Lap 4 (see Table of Contents) to test the 


claim that Terri finishes Lap 4, on average, in less than 129 seconds. 
Use all twenty races given. 


Exercise: 
Problem: 
Use the “Initial Public Offering” data (see Table of Contents) to test 
the claim that the mean offer price was $18 per share. Do not use all 


the data. Use your random number generator to randomly survey 15 
prices. 


Note:The following questions were written by past students. They are 
excellent problems! 


Exercise: 


Problem:18. "Asian Family Reunion" by Chau Nguyen 


Every two years it comes around 

We all get together from different towns. 
In my honest opinion 

It's not a typical family reunion 

Not forty, or fifty, or sixty, 

But how about seventy companions! 

The kids would play, scream, and shout 
One minute they're happy, another they'll pout. 
The teenagers would look, stare, and compare 
From how they look to what they wear. 

The men would chat about their business 

That they make more, but never less. 

Money is always their subject 

And there's always talk of more new projects. 
The women get tired from all of the chats 
They head to the kitchen to set out the mats. 
Some would sit and some would stand 

Eating and talking with plates in their hands. 
Then come the games and the songs 

And suddenly, everyone gets along! 

With all that laughter, it's sad to say 

That it always ends in the same old way. 

They hug and kiss and say "good-bye" 

And then they all begin to cry! 

I say that 60 percent shed their tears 

But my mom counted 35 people this year. 

She said that boys and men will always have 
their pride, 

So we won't ever see them cry. 

I myself don't think she's correct, 

So could you please try this problem to see if 
you object? 


Exercise: 


Problem:"The Problem with Angels" by Cyndy Dowling 


Although this problem is wholly mine, 

The catalyst came from the magazine, Time. 
On the magazine cover I did find 

The realm of angels tickling my mind. 


Inside, 69% I found to be 
In angels, Americans do believe. 


Then, it was time to rise to the task, 
Ninety-five high school and college students I 
did ask. 

Viewing all as one group, 

Random sampling to get the scoop. 


So, I asked each to be true, 
"Do you believe in angels?" Tell me, do! 


Hypothesizing at the start, 
Totally believing in my heart 
That the proportion who said yes 
Would be equal on this test. 


Lo and behold, seventy-three did arrive, 
Out of the sample of ninety-five. 

Now your job has just begun, 

Solve this problem and have some fun. 


Solution: 


e e1.65 

e £0.0984 

e hDecision: Do not reject null 
e i (0.6836,0.8533) 


Exercise: 


Problem:"Blowing Bubbles" by Sondra Prull 


Studying stats just made me tense, 
I had to find some sane defense. 
Some light and lifting simple play 
To float my math anxiety away. 


Blowing bubbles lifts me high 

Takes my troubles to the sky. 

POIK! They're gone, with all my stress 
Bubble therapy is the best. 


The label said each time I blew 

The average number of bubbles would be at least 
22. 

I blew and blew and this I found 

From 64 blows, they all are round! 


But the number of bubbles in 64 blows 
Varied widely, this I know. 

20 per blow became the mean 

They deviated by 6, and not 16. 


From counting bubbles, I sure did relax 
But now I give to you your task. 

Was 22 a reasonable guess? 

Find the answer and pass this test! 


Exercise: 


Problem:21. "Dalmatian Damation" by Kathy Sparling 


A greedy dog breeder named Spreckles 
Bred puppies with numerous freckles 
The Dalmatians he sought 


Possessed spot upon spot 
The more spots, he thought, the more shekels. 


His competitors did not agree 

That freckles would increase the fee. 
They said, “Spots are quite nice 

But they don't affect price; 

One should breed for improved pedigree.” 


The breeders decided to prove 
This strategy was a wrong move. 
Breeding only for spots 

Would wreak havoc, they thought. 
His theory they want to disprove. 


They proposed a contest to Spreckles 
Comparing dog prices to freckles. 

In records they looked up 

One hundred one pups: 

Dalmatians that fetched the most shekels. 


They asked Mr. Spreckles to name 

An average spot count he'd claim 

To bring in big bucks. 

Said Spreckles, “Well, shucks, 

It's for one hundred one that I aim.” 


Said an amateur statistician 

Who wanted to help with this mission. 
“Twenty-one for the sample 

Standard deviation's ample: 


They examined one hundred and one 
Dalmatians that fetched a good sum. 
They counted each spot, 

Mark, freckle and dot 

And tallied up every one. 


Instead of one hundred one spots 

They averaged ninety six dots 

Can they muzzle Spreckles’ 

Obsession with freckles 

Based on all the dog data they've got? 


Solution: 


e e-2.39 

e £0.0093 

e hDecision: Reject null 
¢ i (91.854,100.15) 


Exercise: 


Problem: 


"Macaroni and Cheese, please!!" by Nedda Misherghi and Rachelle 
Hall 


As a poor starving student I don't have much money to spend for even 
the bare necessities. So my favorite and main staple food is macaroni 
and cheese. It's high in taste and low in cost and nutritional value. 


One day, as I sat down to determine the meaning of life, I got a serious 
craving for this, oh, so important, food of my life. So I went down the 
street to Greatway to get a box of macaroni and cheese, but it was SO 
expensive! $2.02 !!! Can you believe it? It made me stop and think. 
The world is changing fast. I had thought that the mean cost of a box 
(the normal size, not some super-gigantic-family-value-pack) was at 
most $1, but now I wasn't so sure. However, I was determined to find 
out. I went to 53 of the closest grocery stores and surveyed the prices 
of macaroni and cheese. Here are the data I wrote in my notebook: 
Price per box of Mac and Cheese: 


e 5 stores @ $2.02 
e 15 stores @ $0.25 


e 3stores @ $1.29 
e 6 stores @ $0.35 
e Astores @ $2.27 
e 7 stores @ $1.50 
e 5 stores @ $1.89 
e 8 stores @ 0.75. 


I could see that the costs varied but I had to sit down to figure out 
whether or not I was right. If it does turn out that this mouth-watering 
dish is at most $1, then I'll throw a big cheesy party in our next 
statistics lab, with enough macaroni and cheese for just me. (After all, 
as a poor starving student I can't be expected to feed our class of 
animals!) 


Exercise: 
Problem: 
"William Shakespeare: The Tragedy of Hamlet, Prince of Denmark" 


by Jacqueline Ghodsi 
THE CHARACTERS (in order of appearance): 


¢ HAMLET, Prince of Denmark and student of Statistics 
e POLONIUS, Hamlet’s tutor 
e HOROTIO, friend to Hamlet and fellow student 


Scene: The great library of the castle, in which Hamlet does his lessons 
Act I 


(The day is fair, but the face of Hamlet is clouded. He paces the large 
room. His tutor, Polonius, is reprimanding Hamlet regarding the 
latter’s recent experience. Horatio is seated at the large table at right 
stage.) 


POLONIUS: My Lord, how cans’t thou admit that thou hast seen a 
ghost! It is but a figment of your imagination! 


HAMLET: I beg to differ; I know of a certainty that five-and-seventy 
in one hundred of us, condemned to the whips and scorns of time as 
we are, have gazed upon a spirit of health, or goblin damn’d, be their 
intents wicked or charitable. 


POLONIUS If thou doest insist upon thy wretched vision then let me 
invest your time; be true to thy work and speak to me through the 
reason of the null and alternate hypotheses. (He turns to Horatio.) Did 
not Hamlet himself say, “What piece of work is man, how noble in 
reason, how infinite in faculties? Then let not this foolishness persist. 
Go, Horatio, make a survey of three-and-sixty and discover what the 
true proportion be. For my part, I will never succumb to this fantasy, 
but deem man to be devoid of all reason should thy proposal of at least 
five-and-seventy in one hundred hold true. 


HORATIO (to Hamlet): What should we do, my Lord? 
HAMLET: Go to thy purpose, Horatio. 
HORATIO: To what end, my Lord? 


HAMLET: That you must teach me. But let me conjure you by the 
rights of our fellowship, by the consonance of our youth, but the 
obligation of our ever-preserved love, be even and direct with me, 
whether I am right or no. 


(Horatio exits, followed by Polonius, leaving Hamlet to ponder alone.) 
Act Il 


(The next day, Hamlet awaits anxiously the presence of his friend, 
Horatio. Polonius enters and places some books upon the table just a 
moment before Horatio enters.) 


POLONIUS: So, Horatio, what is it thou didst reveal through thy 
deliberations? 


HORATIO: In a random survey, for which purpose thou thyself sent 
me forth, I did discover that one-and-forty believe fervently that the 
spirits of the dead walk with us. Before my God, I might not this 
believe, without the sensible and true avouch of mine own eyes. 


POLONIUS: Give thine own thoughts no tongue, Horatio. (Polonius 
turns to Hamlet.) But look to’t I charge you, my Lord. Come Horatio, 
let us go together, for this is not our test. (Horatio and Polonius leave 
together.) 


HAMLET: To reject, or not reject, that is the question: whether ‘tis 
nobler in the mind to suffer the slings and arrows of outrageous 
Statistics, or to take arms against a sea of data, and, by opposing, end 
them. (Hamlet resignedly attends to his task.) 


(Curtain falls) 


Solution: 


e e-1.82 
e (0.0345 


e hDecision: Do not reject null 
e i (0.5331,0.7685) 


Exercise: 


Problem:"Untitled" by Stephen Chen 


I've often wondered how software is released and sold to the public. 
Ironically, I work for a company that sells products with known 
problems. Unfortunately, most of the problems are difficult to create, 
which makes them difficult to fix. I usually use the test program X, 
which tests the product, to try to create a specific problem. When the 
test program is run to make an error occur, the likelihood of generating 
an error is 1%. 


So, armed with this knowledge, I wrote a new test program Y that will 
generate the same error that test program X creates, but more often. To 
find out if my test program is better than the original, so that I can 
convince the management that I'm right, I ran my test program to find 
out how often I can generate the same error. When I ran my test 
program 50 times, I generated the error twice. While this may not 
seem much better, I think that I can convince the management to use 
my test program instead of the original test program. Am I right? 


Exercise: 


Problem: Japanese Girls’ Names 
by Kumi Furuichi 


It used to be very typical for Japanese girls’ names to end with “ko.” 
(The trend might have started around my grandmothers’ generation 
and its peak might have been around my mother’s generation.) “Ko” 
means “child” in Chinese character. Parents would name their 
daughters with “ko” attaching to other Chinese characters which have 
meanings that they want their daughters to become, such as Sachiko — 
a happy child, Yoshiko — a good child, Yasuko — a healthy child, and 
sO on. 


However, I noticed recently that only two out of nine of my Japanese 
girlfriends at this school have names which end with “ko.” More and 
more, parents seem to have become creative, modernized, and, 
sometimes, westernized in naming their children. 


I have a feeling that, while 70 percent or more of my mother’s 
generation would have names with “ko” at the end, the proportion has 
dropped among my peers. I wrote down all my Japanese friends’, ex- 
classmates’, co-workers, and acquaintances’ names that I could 
remember. Below are the names. (Some are repeats.) Test to see if the 
proportion has dropped for this generation. 


Ai, Akemi, Akiko, Ayumi, Chiaki, Chie, Eiko, Eri, Eriko, Fumiko, 
Harumi, Hitomi, Hiroko, Hiroko, Hidemi, Hisako, Hinako, Izumi, 


Izumi, Junko, Junko, Kana, Kanako, Kanayo, Kayo, Kayoko, Kazumi, 
Keiko, Keiko, Kei, Kumi, Kumiko, Kyoko, Kyoko, Madoka, Maho, 
Mai, Maiko, Maki, Miki, Miki, Mikiko, Mina, Minako, Miyako, 
Momoko, Nana, Naoko, Naoko, Naoko, Noriko, Rieko, Rika, Rika, 
Rumiko, Rei, Reiko, Reiko, Sachiko, Sachiko, Sachiyo, Saki, Sayaka, 
Sayoko, Sayuri, Seiko, Shiho, Shizuka, Sumiko, Takako, Takako, 
Tomoe, Tomoe, Tomoko, Touko, Yasuko, Yasuko, Yasuyo, Yoko, 
Yoko, Yoko, Yoshiko, Yoshiko, Yoshiko, Yuka, Yuki, Yuki, Yukiko, 
Yuko, Yuko. 


Solution: 


ez = —2.99 

f0.0014 

hDecision: Reject null; Conclusion: p < .70 
i (0.4529,0.6582) 


Exercise: 


Problem: Phillip’s Wish by Suzanne Osorio 


My nephew likes to play 

Chasing the girls makes his day. 

He asked his mother 

If it is okay 

To get his ear pierced. 

She said, “No way!” 

To poke a hole through your ear, 

Is not what I want for you, dear. 

He argued his point quite well, 

Says even my macho pal, Mel, 

Has gotten this done. 

It’s all just for fun. 

C’mon please, mom, please, what the hell. 
Again Phillip complained to his mother, 
Saying half his friends (including their 
brothers) 


Are piercing their ears 

And they have no fears 

He wants to be like the others. 

She said, “I think it’s much less. 

We must do a hypothesis test. 

And if you are right, 

I won’t put up a fight. 

But, if not, then my case will rest.” 
We proceeded to call fifty guys 

To see whose prediction would fly. 
Nineteen of the fifty 

Said piercing was nifty 

And earrings they’d occasionally buy. 
Then there’s the other thirty-one, 
Who said they’d never have this done. 
So now this poem’s finished. 

Will his hopes be diminished, 

Or will my nephew have his fun? 


Exercise: 


Problem: The Craven by Mark Salangsang 


Once upon a morning dreary 

In stats class I was weak and weary. 
Pondering over last night’s homework 
Whose answers were now on the board 

This I did and nothing more. 


While I nodded nearly napping 
Suddenly, there came a tapping. 

AS someone gently rapping, 

Rapping my head as I snore. 

Quoth the teacher, “Sleep no more.” 


“In every class you fall asleep,” 
The teacher said, his voice was deep. 


“So a tally I’ve begun to keep 
Of every class you nap and snore. 
The percentage being forty-four.” 


“My dear teacher I must confess, 

While sleeping is what I do best. 

The percentage, I think, must be less, 
A percentage less than forty-four.” 
This I said and nothing more. 


“We'll see,” he said and walked away, 
And fifty classes from that day 
He counted till the month of May 
The classes in which I napped and snored. 
The number he found was twenty-four. 


At a Significance level of 0.05, 
Please tell me am I still alive? 
Or did my grade just take a dive 
Plunging down beneath the floor? 
Upon thee I hereby implore. 


Solution: 


e e0.57 
e f0.7156 


e hDecision: Do not reject null 
e i (0.3415,0.6185) 


Exercise: 


Problem: 


Toastmasters International cites a report by Gallop Poll that 40% of 
Americans fear public speaking. A student believes that less than 40% 
of students at her school fear public speaking. She randomly surveys 
361 schoolmates and finds that 135 report they fear public speaking. 
Conduct a hypothesis test to determine if the percent at her school is 
less than 40%. (Source: http:/Atoastmasters.org/artisan/detail.asp? 


Exercise: 


Problem: 


68% of online courses taught at community colleges nationwide were 
taught by full-time faculty. To test if 68% also represents California’s 
percent for full-time faculty teaching the online classes, Long Beach 
City College (LBCC), CA, was randomly selected for comparison. In 
the same year, 34 of the 44 online courses LBCC offered were taught 
by full-time faculty. Conduct a hypothesis test to determine if 68% 
represents CA. NOTE: For more accurate results, use more CA 
community colleges and this past year's data. (Sources: Growing by 
Degrees by Allen and Seaman; Amit Schitai, Director of Instructional 
Technology and Distance Learning, LBCC). 


Solution: 


e e132 

e £0.1873 

e hDecision: Do not reject null 
e i (0.65,0.90) 


Exercise: 


Problem: 


According to an article in Bloomberg Businessweek, New York City's 
most recent adult smoking rate is 14%. Suppose that a survey is 
conducted to determine this year’s rate. Nine out of 70 randomly 
chosen N.Y. City residents reply that they smoke. Conduct a 
hypothesis test to determine if the rate is still 14% or if it has 
decreased. (Source: http:/,www.businessweek.com/news/2011-09- 
15/nyc-smoking-rate-falls-to-record-low-of-14-bloomberg-says.html) 


Exercise: 


Problem: 


The mean age of De Anza College students in a previous term was 26.6 
years old. An instructor thinks the mean age for online students is older 
than 26.6. She randomly surveys 56 online students and finds that the 
sample mean is 29.4 with a standard deviation of 2.1. Conduct a 
hypothesis test. (Source: 


Solution: 


e e9.98 

¢ f0.0000 

e hDecision: Reject null 
e i (28.8,30.0) 


Exercise: 


Problem: 


Registered nurses earned an average annual salary of $69,110. For that 
same year, a survey was conducted of 41 California registered nurses 
to determine if the annual salary is higher than $69,110 for California 
nurses. The sample average was $71,121 with a sample standard 
deviation of $7,489. Conduct a hypothesis test. (Source: 
http://www.bls.gov/oes/current/oes291111.htm) 


Exercise: 


Problem: 


La Leche League International reports that the mean age of weaning a 
child from breastfeeding is age 4 to 5 worldwide. In America, most 
nursing mothers wean their children much earlier. Suppose a random 
survey is conducted of 21 U.S. mothers who recently weaned their 
children. The mean weaning age was 9 months (3/4 year) with a 
standard deviation of 4 months. Conduct a hypothesis test to determine 
if the mean weaning age in the U.S. is less than 4 years old. (Source: 
http://www. lalecheleague.org/Law/BAFeb01.html) 


Solution: 


e e-44.7 

e f0.0000 

e hDecision: Reject null 
e i (0.60,0.90) - in years 


Try these multiple choice questions. 


Exercise: 


Problem: 


When a new drug is created, the pharmaceutical company must subject 
it to testing before receiving the necessary permission from the Food 
and Drug Administration (FDA) to market the drug. Suppose the null 
hypothesis is “the drug is unsafe.” What is the Type II Error? 


e ATo conclude the drug is safe when in, fact, it is unsafe 

¢ BTo not conclude the drug is safe when, in fact, it is safe. 

¢ CTo conclude the drug is safe when, in fact, it is safe. 

¢ DTo not conclude the drug is unsafe when, in fact, it is unsafe 


Solution: 


B 


The next two questions refer to the following information: Over the past 
few decades, public health officials have examined the link between weight 
concerns and teen girls smoking. Researchers surveyed a group of 273 
randomly selected teen girls living in Massachusetts (between 12 and 15 
years old). After four years the girls were surveyed again. Sixty-three (63) 
said they smoked to stay thin. Is there good evidence that more than thirty 
percent of the teen girls smoke to stay thin? 

Exercise: 


Problem:The alternate hypothesis is 


¢ Ap < 0.30 
© Bp < 0.30 
© Cp> 0.30 
¢ Dp> 0.30 


Solution: 


D 


Exercise: 


Problem: After conducting the test, your decision and conclusion are 


e AReject H,: There is sufficient evidence to conclude that more 
than 30% of teen girls smoke to stay thin. 

¢ BDo not reject H,: There is not sufficient evidence to conclude 
that less than 30% of teen girls smoke to stay thin. 

¢ CDo not reject H,: There is not sufficient evidence to conclude 
that more than 30% of teen girls smoke to stay thin. 

e DReject H,: There is sufficient evidence to conclude that less 
than 30% of teen girls smoke to stay thin. 


Solution: 


C 


The next three questions refer to the following information: A statistics 
instructor believes that fewer than 20% of Evergreen Valley College (EVC) 
students attended the opening night midnight showing of the latest Harry 
Potter movie. She surveys 84 of her students and finds that 11 of attended 
the midnight showing. 

Exercise: 


Problem: An appropriate alternative hypothesis is 


¢ Ap= 0.20 
¢ Bp > 0.20 
¢e Cp< 0.20 
¢ Dp < 0.20 


Solution: 


C 


Exercise: 


Problem: At a 1% level of significance, an appropriate conclusion is: 


e AThere is insufficient evidence to conclude that the percent of 
EVC students that attended the midnight showing of Harry Potter 
is less than 20%. 

e BThere is sufficient evidence to conclude that the percent of EVC 
students that attended the midnight showing of Harry Potter is 
more than 20%. 

e CThere is sufficient evidence to conclude that the percent of EVC 
students that attended the midnight showing of Harry Potter is 
less than 20%. 


¢ DThere is insufficient evidence to conclude that the percent of 
EVC students that attended the midnight showing of Harry Potter 
is at least 20%. 


Solution: 


A 
Exercise: 


Problem: 


The Type I error is to conclude that the percent of EVC students who 
attended is 


e Aat least 20%, when in fact, it is less than 20%. 
¢ B20%, when in fact, it is 20%. 

e Cless than 20%, when in fact, it is at least 20%. 

e Dless than 20%, when in fact, it is less than 20%. 


Solution: 


C 


The next two questions refer to the following information: 


It is believed that Lake Tahoe Community College (LTCC) Intermediate 
Algebra students get less than 7 hours of sleep per night, on average. A 
survey of 22 LTCC Intermediate Algebra students generated a mean of 7.24 
hours with a standard deviation of 1.93 hours. At a level of significance of 
5%, do LTCC Intermediate Algebra students get less than 7 hours of sleep 
per night, on average? 

Exercise: 


Problem:The distribution to be used for this test is _X ~ 


¢ A N(7.24,+22) 


? ./22 
¢ B N(7.24,1.93) 
e Cit 
¢ Dt2 
Solution: 
D 
Exercise: 
Problem: 


The Type II error is to not reject that the mean number of hours of 
sleep LTCC students get per night is at least 7 when, in fact, the mean 
number of hours 


e Ais more than 7 hours. 
e Bis at most 7 hours. 

e Cis at least 7 hours. 

e Dis less than 7 hours. 


Solution: 


D 


The next three questions refer to the following information: Previously, 
an organization reported that teenagers spent 4.5 hours per week, on 
average, on the phone. The organization thinks that, currently, the mean is 
higher. Fifteen (15) randomly chosen teenagers were asked how many hours 
per week they spend on the phone. The sample mean was 4.75 hours with a 
sample standard deviation of 2.0. Conduct a hypothesis test. 

Exercise: 


Problem:The null and alternate hypotheses are: 


AH,:x = 4.5, Haix > 4.5 
B Ho: > 4.5 Hai < 4.5 
C H,: = 4.75 Hyp > 4.75 
D How = 4.5 Hou > 4.5 


Solution: 


D 


Exercise: 


Problem: 


At a significance level of a = 0.05, what is the correct conclusion? 


AThere is enough evidence to conclude that the mean number of 
hours is more than 4.75 

BThere is enough evidence to conclude that the mean number of 
hours is more than 4.5 

CThere is not enough evidence to conclude that the mean number 
of hours is more than 4.5 

DThere is not enough evidence to conclude that the mean number 
of hours is more than 4.75 


Solution: 


C 


Exercise: 


Problem:The Type I error is: 


ATo conclude that the current mean hours per week is higher than 
4.5, when in fact, it is higher. 

BTo conclude that the current mean hours per week is higher than 
4.5, when in fact, it is the same. 


¢ CTo conclude that the mean hours per week currently is 4.5, 
when in fact, it is higher. 

¢ DTo conclude that the mean hours per week currently is no higher 
than 4.5, when in fact, it is not higher. 


Solution: 


B 


12.16 Hypothesis Testing of Single Mean and Single Proportion: Review 
TCC 

This module provides an overview of Hypothesis Testing of Single Mean 
and Single Proportion as a part of Collaborative Statistics collection 
(col10522) by Barbara Illowsky and Susan Dean. 

Exercise: 


Problem: 


Rebecca and Matt are 14 year old twins. Matt’s height is 2 standard 
deviations below the mean for 14 year old boys’ height. Rebecca’s 
height is 0.10 standard deviations above the mean for 14 year old girls 
height. Interpret this. 


’ 


e AMatt is 2.1 inches shorter than Rebecca 

e BRebecca is very tall compared to other 14 year old girls. 
e CRebecca is taller than Matt. 

e DMatt is shorter than the average 14 year old boy. 


Solution: 


D 
Exercise: 


Problem: 


Construct a histogram of the IPO data (see Table of Contents, 14. 
Appendix, Data Sets). Use 5 intervals. 


Solution: 


No solution provided. There are several ways in which the histogram 
could be constructed. 


The next three exercises refer to the following information: Ninety 
homeowners were asked the number of estimates they obtained before 
having their homes fumigated. =the number of estimates. 


Rel. Freq. Cumulative Rel. Freq. 


1 0.3 
2 0.2 
4 0.4 
rs) 0.1 


Complete the cumulative relative frequency column. 
Exercise: 


Problem: 


Calculate the sample mean (a), the sample standard deviation (b) and 
the percent of the estimates that fall at or below 4 (c). 


Solution: 
e a2.8 


e b 1.48 
e c90% 


Exercise: 


Problem: 


Calculate the median, M, the first quartile, Q1, the third quartile, Q3. 
Then construct a boxplot of the data. 


Solution: 


Exercise: 


Problem: The middle 50% of the data are between and 


Solution: 


1 and 4 


The next three questions refer to the following table: Seventy 5th and 
6th graders were asked their favorite dinner. 


Fried 
Pizza Hamburgers Spaghetti shrimp 


oth 


15 6 9 0 
grader 
6th 15 7 10 8 
grader 
Exercise: 
Problem: 


Find the probability that one randomly chosen child is in the 6th grade 
and prefers fried shrimp. 


Solution: 


D 


Exercise: 


Problem: Find the probability that a child does not prefer pizza. 


Solution: 


C 
Exercise: 


Problem: 


Find the probability a child is in the 5th grade given that the child 
prefers spaghetti. 


Solution: 
A 
Exercise: 
Problem: A sample of convenience is a random sample. 


e Atrue 


e Bfalse 


Solution: 
B 
Exercise: 
Problem: A statistic is a number that is a property of the population. 


e Atrue 
e Bfalse 


Solution: 


B 


Exercise: 


Problem: You should always throw out any data that are outliers. 


e Atrue 
e Bfalse 


Solution: 


B 
Exercise: 


Problem: 


Lee bakes pies for a small restaurant in Felton, CA. She generally 
bakes 20 pies in a day, on the average. Of interest is the num.ber of 
pies she bakes each day 


e aDefine the Random Variable 


e bState the distribution for 
e cFind the probability that Lee bakes more than 25 pies in any 
given day. 


Solution: 


° b 
e c0.1122 


Exercise: 
Problem: 
Six different brands of Italian salad dressing were randomly selected at 
a supermarket. The grams of fat per serving are 7, 7, 9, 6, 8, 5. Assume 
that the underlying distribution is normal. Calculate a 95% confidence 


interval for the population mean grams of fat per serving of Italian 
salad dressing sold in supermarkets. 


Solution: 


CI: 
Exercise: 


Problem: 


Given: uniform, exponential, normal distributions. Match each to a 
statement below. 


e amean = median ~ mode 
e bmean > median > mode 
e cmean = median = mode 


Solution: 


e auniform 


e bexponential 
¢ cnormal 


